Improving Phenotype Name Recognition
نویسندگان
چکیده
Due to the rapidly increasing amount of biomedical literature, automatic processing of biomedical papers is extremely important. Named Entity Recognition (NER) in this type of writing has several difficulties. In this paper we present a system to find phenotype names in biomedical literature. The system is based on Metamap and makes use of the UMLS Metathesaurus and the Human Phenotype Ontology. From an initial basic system that uses only these preexisting tools, five rules that capture stylistic and linguistic properties of this type of literature are proposed to enhance the performance of our NER tool. The tool is tested on a small corpus and the results (precision 97.6% and recall 88.3%) demonstrate its performance.
منابع مشابه
Extracting Personal Names from Email: Applying Named Entity Recognition to Informal Text
There has been little prior work on Named Entity Recognition for ”informal” documents like email. We present two methods for improving performance of person name recognizers for email: emailspecific structural features and a recallenhancing method which exploits name repetition across multiple documents.
متن کاملEvaluation of the genetic test components
The sequencing of human genome leads to obtain an important data on genetic elements having a crucial role in the molecular pathology of genetic disorders. This is the reason of introducing genetic tests to medical field. Genetic testing looks for changes at chromosomes, genes and protein level to detect heritable conditions for clinical purposes. Genetic tests used in routine practice are va...
متن کاملName Searching and Information Retrieval
The main application of name searching has b e ~ name matching in a database of names. This paper discusses a different application: improving information retrieval through name recognition. It investigates name recognition accuracy, and the effect on retrieval performance of indexing and searching personal names differently from non-name terms in the context of ranked retrieval. The main concl...
متن کاملExtracting Personal Names from Email: Applying Named Entity Recognition to Informal Text
There has been little prior work on Named Entity Recognition for ”informal” documents like email. We present two methods for improving performance of person name recognizers for email: emailspecific structural features and a recallenhancing method which exploits name repetition across multiple documents.
متن کاملAbout improving recognition of spontaneously uttered French city-names
This paper deals with the recognition of French city-names over the telephone. This recognition task, critical in many applications, involves a 40,000 city-name vocabulary, ranging from short monosyllabic words to long official compoundnames. Data collected from a field experiment are analyzed, and several ways of improving speech recognition performance are investigated. This includes a carefu...
متن کامل